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THE OSCILLATORY DISTRIBUTION OF DISTANCES IN 
RANDOM TRIES 

By Costas A. Christophi and Hosam M. Mahmoud 

George Washington University 

We investigate A n , the distance between randomly selected pairs 
of nodes among n keys in a random trie, which is a kind of digital 
tree. Analytical techniques, such as the Mellin transform and an ex- 
cursion between poissonization and depoissonization, capture small 
fluctuations in the mean and variance of these random distances. The 
mean increases logarithmically in the number of keys, but curiously 
enough the variance remains 0(1), as n — > oo. It is demonstrated that 
the centered random variable = A n — [2 log 2 nj does not have a 
limit distribution, but rather oscillates between two distributions. 

1. Introduction. In this computer age that we are living in, digital data, 
which are represented and accessed via their composition into digits, are 
quite ubiquitous in science and technology. They are prevalent in computer 
files, telecommunication signals, DNA and so forth. Digital searching deals 
with the storage of information in a computer's memory and its fast recovery 
[9]. Various forms of digital trees are known to be super efficient for this 
purpose. The subject of this investigation is the trie, one popular such digital 
technology, initially proposed independently by De La Briandais [2] and 
Fredkin [4] for information retrieval. 

In informatics, distances between nodes in a random combinatorial object 
are of prime interest, because they are indicative of the speed of communica- 
tion within the structure. These distances have applications in many other 
fields too. For example, the collective sum of all such distances in the graph 
underlying a molecule is known in chemistry as the Wiener index (see [5] 
and [19]). 

In this paper we look at the distances between distinct pairs of keys in 
random binary tries. The mean and variance and ultimately the asymptotic 
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distribution (via its moment generating function) are derived by analytical 
methods involving the use of poissonization, as a mathematical transform, 
and of depoissonization, as an asymptotic inverse transform. Although the 
chief interest lies in studying the random tree for a fixed population of n 
keys, the recurrence equations involved are rather unwieldy. If a Poisson 
number of keys is assumed instead, the functional equations involved can 
asymptotically be solved by the Mellin transform and its inverse. It can 
then be justified that the solution is a good approximation (with quantifiable 
small errors) for the fixed-population problem, when the Poisson parameter 
is taken to be n, as n — > oo. 

Analogous studies have been conducted on binary search trees (a different 
model of random trees), as, for example, the study of the Wiener index [14], 
and the study of random distances [13] and its generalization to spanning 
trees of subsets of nodes [15]. However, the methodology for binary search 
trees is essentially different from the analytic tool-kit we employ here for 
tries. 

Assume a standard Bernoulli probability model on data. Let A n be the 
distance between two randomly chosen keys in a trie. Two of our results 
concern the mean and variance, and the periodic fluctuations therein (the 
notation lg stands for log 2 ): 



where a ~ 7.227113 is a constant explicitly found as a numerical series, 
and r)(-) and £(•) are bounded functions of miniscule magnitude which are 
periodic in their argument. The error terms include additional small periodic 
functions as well. Note that the variance is O(l), a highly desirable feature in 
a data structure; this property indicates robustness of random digital trees 
in spite of the large variation in the data. 

In view of the O(l) variance, one expects that if a nontrivial limit dis- 
tribution were to exist, it would require only centering, but no scaling. Our 
main result is that a centered integer version of A n does not converge in dis- 
tribution to any random variable. Rather its distribution function assumes 
the form of a discrete staircase distribution which oscillates between two 
distinct fixed discrete distribution functions. 

The main results have been sketched. The rest of the paper is organized 
in sections as follows. In Section 2 the definition of a trie is given and illus- 
trated by an example, and the associated trie terminology is made precise. 
The probability distribution assumed on the data is also discussed. At the 
end of Section 2, notation that is used as a working language throughout 
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is explained. The moments are discussed in Section 3 and the moment gen- 
erating function is derived in Section 3.1. The poissonization-Mellin- inverse 
Mellin-depoissonization program is illustrated in this context. The residue 
calculation is taken up in Section 3.2. Some of the very lengthy calculations 
are relegated to Appendixes A and B. The first two moments are pumped out 
from the Mellin transform of the moment generating function. This is done 
thoroughly for the mean in Section 3.3 and only sketched for the variance 
in Section 3.4. The oscillating nature of the moment generating function 
elicits the nonexistence of a limit distribution. This is shown in Section 3.5 
by demonstrating explicit fluctuations in the moment generating function. 
Section 4 concludes the paper with an interpretation of the results and a 
discussion of their scope. 

2. Tries. A trie is a digital tree structure consisting of internal nodes that 
have one or two children, and leaf nodes that hold data items (commonly 
called keys). We shall assume our data to be binary (although our methods 
should work well for larger alphabets too). Suppose we have n > keys given 
in their dyadic representation. Data are scaled to be in the interval [0,1]. 
The trie grows by the progressive insertion of keys. If n = 0, nothing needs 
to be done; the insertion algorithm terminates. If n = 1, a leaf is allocated 
for the single key given. If n > 2, an internal node is allocated as a root of the 
tree; keys with as their most significant bit go to the left subtree, and keys 
with 1 to the right. Subsequently, in the subtrees the insertion algorithm 
is applied recursively, using the (£ + l)st most significant bit of the key for 
branching at level I. At termination, each key is stored in a leaf by itself, and 
the root-to-leaf paths in the tree correspond to minimal prefixes sufficient 
to distinguish the keys. As an illustration, suppose that n = 5 and that the 
data are the following: 





= 0.00111 


x 2 


= 0.11011 


A 3 


= 0.00011 


A 4 


= 0.01010 


a 5 


= 0.11111 



The algorithm guides Xi,X$ and A4 to the left subtree, and it guides X2 
and A5 to the right one. Then, in the left subtree X\ and A3 go to the left 
but A4 goes to the right, whereas in the right subtree, both X2 and A5 go 
to the right. The subtree containing X\ and A3 is developed further into a 
left subtree with A3 being a leaf node as it is the only node in the subtree, 
and a right subtree with Ai contained in a leaf node as it is the only node 
in the subtree, and so on. The resulting trie is depicted in Figure 1, with 
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Fig. 1. A trie with 5 keys. 



bullets indicating internal nodes and squares indicating leaf nodes, where 
the keys are stored. 

We assume a standard model of randomness, according to which keys 
are independent and of infinite length; within each the bits are independent 
and equiprobable. Thus, each key can be viewed as an infinite sequence of 
Bernoulli trials. This probability model is often called the unbiased Bernoulli 
model. 

We define A n as the distance (i.e., the number of tree edges) between two 
randomly selected keys in a random trie of size n, with random meaning 
that all (2) pairs of keys are equally likely choices. For example, in Figure 1 
the distance between X2 and X4 is 5. A recursive formulation for A w goes 
via 6 n , the depth of a randomly selected key in a random trie of size n. 

Furthermore, the Mellin transform of a function fix) is 

/ f(x)x s - 1 dx, 
Jo 

and will be denoted by Mellin{/(x); s} or, interchangeably, by f*(s). The 
Mellin transform usually exists in vertical strips in the s complex plane of 
the form 

a < "Sts < b, 

for real numbers a <b. We shall denote such a domain of existence by (a, b) . 
The function f(x) can be recovered from its transform by a line integral 

1 rc+ico 

f(x) = — r(s)x- s d S , 

ZTTl J c—ioc 

for any c G (a, b). 

3. Moments of the random distance. Let L n and R n be, respectively, 
the number of keys residing in the left and right subtrees, among the n keys 
received by the tree (so, L n + R n = n). In view of the unbiased Bernoulli 
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model, L n and R n are both distributed like Bin(n, ^), a binomially dis- 
tributed random variable on n independent trials and rate of success ^ per 
trial. Because of the probability distribution of the keys and the recursive 
action of the insertion algorithm, each of the left and the right subtree pre- 
serves the probabilistic structure of the trie. The only difference is that they 
are random tries constructed on L n and R n keys, respectively, instead of n. 

Given L n , A n can be Ai n with probability ( L £) / Q) when both keys come 
from the left subtree, Ar u with probability ( R £) / when both keys come 
from the right subtree or (5i n + 1) + (Sr„ + 1) with probability L n R n / 
when the keys come from different subtrees [here a tilded random variable 
indicates an independent copy (with the same distribution) of that random 
variable]. Hence, we have the conditional distribution 



(1) A n \L n = < 



(«5 Ln + l) + (^ n + l), 



with probability 



with probability 



fir) 
(2) 
(t) 



(2) ' 

with probability " n , 



where the vector (Ai,...,A n ) denotes a copy of random distances inde- 
pendent of the vector (Ai, . . . , A n ); likewise, (Si,..., S n ) and (8\,. .. , 5 n ) are 
vectors of independent copies of random depths. It is important to note 
that Ax, n and A^ n are dependent through the dependency of L n and R n , 
but given the value of L n (and, hence, R n ), the two are conditionally inde- 
pendent; the same applies to 5l„ and Sji n . 



3.1. The moment generating function. From the conditional recursion (1), 
we have a recurrence for the conditional moment generating function: 



E[e Ant \L n ] = e AL " t 



L 



_|_ e ^ & L n +l) + ^R n +^))tl n H n _ 



Taking expectation of both sides, we essentially get a recurrence for the 
moment generating function: 



E[e Allt ] =E 



L, 



+ E 



D A Rn t I Rn 



+ E[e 



(fen+l) + (^n+l))* 



L n Rn] 



We observe the symmetry of the left and the right subtrees, and the fact 
that L n is distributed like Bin(n, ^). After multiplying by z n and summing 
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over all possible values of n, we get 



oo 
n=0 



nl 



;i=0f=0 



3 ((5£ + l)t. 



E 



n 



(n-£)\ 



1 

2™ ' 



where for any generic random variable X, 4>x{t) '■= E[e tx ]. This gives 



(2) 



<D(M) = 2<I>(t,-)e*/ 2 + £E 

lj=0 



a («i+i)*2. 



)<M,(*) 



where $(t,z) :=£^o^'^ ■ 

Note that e~ z &(t,z) has a poissonization interpretation. For, if N(z) is 
distributed like a Poisson random variable with mean z, then 



3=0 

= E 



M)P(N(z)=j) 



N(z) 
2 



g^JV(z)* 



Averaging (^je^^W* is just like averaging {^)e^ nt , except that we assume 
a Poisson number of keys instead of a fixed population n. As we shall see, 
the advantage of poissonization is that the Poisson model is amenable to 
the formulation of functional equations that can be asymptotically solved 
by the Mellin transform techniques. The various fixed-population problems 
are averaged over cases with Poisson probabilities as weights. The fixed- 
population asymptotics can be well approximated by their depoissonized 
counterparts. 

When we multiply (2) throughout by e~ z , we get the functional equation 
e- z $(t,z) = 2e- z/2 <S>(t,-) +f(i, 



where F(t,z) := [ e * e -*£°^%^'] 2 . 

We want to study convergence issues of moment generating functions in a 
neighborhood around t = (via the Mellin transform). The Mellin transform 
of the latter functional equation does not exist in such a neighborhood to 
recover the moment generating function. For example, at t = 0, the latter 
equation becomes 

z 2 z 2 z 2 
~2 = T + T' 
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and we cannot take the Mellin transform of the equation because z 2 does 
not have such a transform. Nevertheless, the same equation shifted down by 
z 2 /2 has a Mellin transform. It turns out that for general t, the factor e 2 *^- 
is a suitable shift. Introduce this shift to get 



e- z $(t,z) 
that is, 

(3) 



„2f 



„2t 



{z/2f 



Jit ■ 



+ F t, 



P(t,z) = 2P[t,-)+F[t, 



,2t' 



where P(t, z) := e~ z $(t, z) 



2tz± 
e 2 . 



Lemma 1. The Mellin transform of P(t,z) (with respect to the variable 
z) is 



P*(t,s) 



e 2t 2 s (l-e t ) 2 T(s + 2) 
[1 - 2 s + 1 )(l - e 2t 2 s + 2 ) 



2 s+3 e t 
- e *2 s + 2 



tk 



k=0 



(l + l/(2*+i))' 



+2 



+ 



(1 - 2 S + 3 ) 
2 5 + 2 



+ 



2 s+l e 2t {l-e t )T{s + 2) 
(l-2 s +!)(l-e*2 s + 2 ) ' 



well defined for (—3, —2 — /or |i| < | ln2. 

Proof. The bulk of this computation is in the handling of the function 
F(t,z), which is a squared generating function. We relegate this lengthy 
computation to Appendixes A and B. □ 



Note that, for negative t, the strip (—3, —2 — j 2 ^) includes the entire strip 
(—3,-2), and, for all |<| < ^ln2, the Mellin transform exists in (—3,-2 — 
j|4)- The inverse Mellin transform, 



P(t,z) 



1 

2vri 



c+ioo 



P*(t,s)z" s ds, 



recovers the (shifted) poissonized moment generating function, if we take 

2l 1 1 1 

c £ (—3, —2 — j-Lg) , with \t\ < 2 In 2. The transform P*(t, s) has simple poles 

_ 2+ -2t+2^ for 



at s = —2, at s = 
and j = 0, ±1, . 
s = -3,-4,.... 



-l + ^f,forr = 0,±l, 



at s 



, at s — * , ln2 
2 + for i / and /i = 0, ±1, . . . and at 
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We employ the method of "closing the box." (This method is discussed 
in [12] and [18].) In this method one takes the complex integration over the 
line c — iM and c + iM, and then closes the box connecting the four corners 
c±iM and dztiM, for an arbitrary d > 0. The number M is chosen in such 
a way that no pole is crossed. For example, we can take it to be lliQHiddJ. ; 
for integer m. The residue theorem points out that 

(4) lim f P*(t, s)z~ s ds = 2ni\^ residues in (c, d). 

The contour integral can be written as 

pd-iM pd+iM rc+iM rc—iM 

P*(t,s)z- S ds = / + / + / + / 

Jc-iM Jd-iM Jd+iM J c+iM 

As we let m — > oo (hence, M — > oo), the line integrals at the top and bottom 
sides of the box approach 0, as the magnitude of the T function decreases 
exponentially fast with its imaginary part. Moreover, the integral at the 
right side of the box introduces an error term of the order of 0{z~ d ). Hence, 
(4) gives 

P(t,z) =0{z~ d ) -^residues in (c,d). 
The problem has now been reduced to residue computation. 



3.2. Residue computation. We have the following residue calculations: 

z(l 



Res [P*(t,s)z- 

s=—l 



e l )e 2t 



4(1 -2e 2t ) In 2 




Fig. 2. The poles of the Mellin transform P*(t,s); x indicates a simple pole. The shaded 
area is a domain free of poles, where P*(t,s) is well defined. 



OSCILLATORY DISTRIBUTION IN TRIES 



Res[P*(t,s)z~ 
Res \P*(t,s)z- 

s=-2-t/ln2 



Res \P*(t,s)z' 

s=-2-2t/ln2 



}L J -3e f -l 
^ 2 fe+1 + 1 



fc=0 



1 7 2 2t 
2^ e ' 



^ 2 + 2 */ ln2 r(-2t/ln2)e 3i (e i - 1) 
(1 -2e 2 ')ln2 



x i (e »- e *-2) 



(e*-l)E e 

fc=0 



1-1 + 



\2t/ln2- 



2 fcH 



i?es [P*(t,s)z- S ] = (e*-1) 

s=-l+2nri/ln2 



, ^ 2vrirzr(27rir /In 2)e~ 2lTir ^ z 



(l-2e 2 *)ln 2 2 



e 2t (e t -2) 
1 -2e* 

-e 3 V-l) 



00 

x >T e* fc 

k=0 



Res \P*(t,s)z~ s ]=0, h^O, 

s=-2+(-f+27i7ii)/ln2 

and, for j / 0, 

Res [P*{t,s)z- S ] 

s=-2+(-2t+2irji)/ln2 



1-1 + 



^ \ -l-27rir/ln2- 



z 2+2t/ln2 e -2^1g^ r ^_ 2t + 2 ^- J -)/ ln2 )( e t _ ^ 



(1 -2e 2 *)ln2 



e 2t - e* - 2 



r 



e 3 V"l)E e 

fc=0 



1-1 + 



^ \ (2t-27rij)/ln2- s 



2 fc+l ) 
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Hence, for any arbitrary d > 0, 



P(t,z) 



4(1 -2e 2t ) In 2 



fc=0 

-z 2 e 2 * - 



2 fc+i + i 



g 2+2*/ln2 r (_ 2t / ln 2) e 3«( e *_l) 

(1 -2e 2 *)ln2 



x ^(e 2i -e*-2) 



(e*-l)E e 

fc=0 



tk 



I \2t/ln2 

L + 2^t) 



+ K(t,lgz) + 0(z- d ), 



where 



K(t,lg*) = (e*-1) 

/ 27rijzr(27rij/ln2)e- 27r ^ lg; 
A (l-2e 2 *)ln 2 2 



-El 



e 2t (2-e f ) 
l-2e* 



+ e 3 V-l)E« 

fc=0 



1-1 + 



^ \ -l-27rii/ln2- 



2 fc+l 



; 2+2t/ln2 e -2^iIg 2r ((_2t + 2^j)/ln2) 

(l-2e 2i )ln2 
(e 2t -e t -2 



e 3 *( e *-l)^ e ^ 

fc=0 



1-1 + 



I \(2t-2mj)/ln2 



Recall that P(t, z) = e" z $(t, z) - e 2t %- to get 



E 



^(z)^ Ajv(z)t 
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21 



zie — l)e 
4(1 -2e 2i )ln2 



00 P tk(i _ p t\ 



fc=0 



z 2 + 2t / ln2 r(-2t/ln2)e 3f (e f -l) 
(1 -2e 2 *)ln2 

x{l(e 2t -e'-2)-(e<-l)£e 
l z fc=0 

+ K(t,lgz) + 0(z- d ). 



ft 



\ 2f/ln2- 

L+ 2W) 



We next check the conditions of the depoissonization lemma [7] . For the 
reader's convenience, we state this lemma. 



Lemma 2 ([7]). Let {a n }^=o ^ e a sequence of real numbers. Suppose that 
the poissonized function Viz) = Y^T=q ^~e~ z exists and can be analytically 
continued as an entire function of complex z. Fix 6 G (0,7r/2) and let Sg be 
the cone {z: | argz| < 9}. Suppose that there exist positive constants a < 1, 
Px> $2, c an d Zq such that the following conditions hold simultaneously: 

(i) For all z £ Sg with \z\ > zq, 

\r(z)\</h\z\ e . 

(ii) For all z ^ Sg with \z\ > zq, 

\T(z)e z \ < (3 2 \z\ c e alzl . 



Then for large n, 

a n =V{n)+0{n c ~ 1 l 2 Inn). 



For the rest of the paper we restrict t to the interval [—0.1,0.1]. For any 
fixed t in this range, we can think of <j> n (t) as a sequence of n. It is clear that 
for this range of t, the poissonized function P(t, z) is 

( z 2+2f/ln2) ingide the 

cone Sg, and is 0(|z| 2+2 '/ ln2 e |z| cos(> ) outside that cone. 

By Lemma 2, E[Q)e A "'] is the same as the poissonized version, with n 
replacing z. When we divide by Q), we get a depoissonization correction 
error of the order 0{n 2t ^ n2 ~ l l' 2 lgn) in the moment generating function. 
The Mellin inversion gives an error term of 0(n~ d ) for arbitrary d > 0. 
Thus, combined, the two errors cannot be brought below 0(n 2 */ ln2-1 / 2 lgn). 
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Therefore, we have 



(5) 



where 
u(t 



. a tl 2n 2i / ln2 r(-2t/ln2)e 3i (l - e* 



fl -2e 2 *)ln2 



1 



|-(e 2 '-e*-2) 

oo 

"(e f -l)E e 
/,-=o 



/A' 



i \2t/ln2 

L+ 2W) 



+ 4«(*» 1 g«) + 0(ri 2t/ln2 - 1/2 lgn), 



' lgn) = S (l-2e 2 *)ln2 T { In 2 J 

/e 2 *-e'-2 



e 3 V"l)E- 

fc=0 



j \ (2t-27rij')/ln2- 

L + 2W 



[Note that the rest of the terms of n(t,\gz) are subsumed by the O error 
term.] 

3.3. The mean. It may be possible to determine the moments of A n 
directly from (5). However, the error requires subtle handling owing to the 
presence of O terms, which cannot be differentiated without some regularity 
conditions. It is about the same effort to bypass these regularity conditions, 
and work indirectly from the Mellin transforms of the moments. 

The derivative d k P*(t,s)/dt k , evaluated at t = 0, yields (a shifted ver- 
sion of ) the Mellin transform of the expectation of the poissonized quantity 
( N( 2 z) )A k N(z) . We start with the mean (k = 1). Let e~ z A(z) = E[( N ^)A N(z) ], 

and B(z) = e~ z A(z) — z 2 . Taking the first derivative of P*(t, s) (cf. Lemma 1) 
and evaluating it at t = 0, we get 

p* f ,x_ 2'+ 1 r( 8 + 2) 

K) (l-2 s + 1 )(l-2^+ 2 )' 
well defined for (—3,-2). The inverse Mellin transform is 

1 ,-5/2+ico 2 s+1 r( S + 2) 

B(z) = / --, FTT7 — ToT^ ds. 

V) 2lTi i_5/2-ioo (l-2 s + 1 )(l-2^+ 2 ) 

The transform B*(s) has a double pole at s = —2, which will provide the 
asymptotically dominant term (the major part of the complex integration), 
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and simple poles at s = -1 + ^f, for k = 0, ±1, . . . , at s = -2 + for j = 
±1, . . . , and at s = —3, —4, .... The corresponding graph with the poles of 
B*(s) and the area where the function is well defined is depicted in Figure 3. 

We use again the method of closing the box, as was done for the moment 
generating function, to argue that the inverse Mellin transform of B*(s) is 
given by the sum of the residues of poles lying to the right of the vertical 
line = — | and a correction term of the order 0(z~ d ). 



2 

We have the following residues: 



Res [B*(s)z- S ] = -z 2 lgz + z 



2 , , 2 31n2-2 7 



-2 L w ° 21n2 

z 



and 



E 2 5f% W( s )z- s ]=z 2 V2 (k 

— -2+27rji/ln2 



no- 
where 7 = 0.577215 ... is Euler's constant, 

-2niku 



, , 1 v-^ „ / 2irik\ 



ln2 M0 V ln2 



and 



ln2 ^ V In 2 / 



-2-niju 



Remark. The series in the functions rji and 772 are absolutely conver- 
gent, in view of the fast asymptotic decay of the gamma function T(x + iy), 
for fixed x and increasing y — it is well known that T(x + iy) is 0(y x ~ 1 ' 2 e~ 7ry ' 2 ). 
For instance, for some positive constant AT, we have 



r 



Vhi2 J 



-1/2 



<0(1) + 



0(1) + 



A' 



V2vrln2 



E e 
A 



-7r 2 j/ln2 



(1 _ e -7r2/in2y 2 7rln2' 
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Fig. 3. The poles of the Mellin transform B*(s); x indicates a simple pole and (g> inc 
cates a double pole. The shaded area is the domain where B*(s) is well defined. 



Hence, for any arbitrary d > 0, 

B(z) = z 2 \gz - z 2 rj 2 (lgz) 
2 31n2 - 27 
~ Z 21n2 



+ z 



In 2 



+ '7i(lgz) 



+ 0{z- 



The two functions rj\ and 772 are oscillating functions of a truly ignorable 
magnitude. For all z, the magnitude of r\\ is bounded by 0.1426024772 x 10 -4 
and the mag nitude of 772 by 0.1573158429 x 10 -5 . The oscillations occur m 
the lower-order terms, anyway, and do not make B(z) behave in any unstable 
way. This means that, as a function of z, the oscillations are just "waves" 
of a very small amplitude, fluctuating around a "steady" component. 



Recall that B(z) = e~ z A(z) 



and that A(z) = En=o Q — 



So, 



E 



{ N 2 ] 



A 



N(z) 



z 2 \gz- z 2 r]2(lgz) - z 



2 ln2 - 27 



+ z 



21n2 

+ 0(z- d ). 



The right-hand side in the latter relation is 0(z 2+e ), for any e > 0, and 
as in the case of the moment generating function, one can check that the 
conditions of the depoissonization lemma [7] hold. So, in depoissonized form, 
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the result is 



E 



A, 



n 2 lg n — n 2 7]2 (lg n) — n 



ln2-27 



+ n 



1 

hi2 



+ ??i(lgn) 



21n2 
+ 0(max{n~ 



, n 



3 / 2+£ lnn}), 



for arbitrary e > 0. As d > is also arbitrary in the error term, we can take 
the error to be 0(n 3 / 2+£ ). When we divide by Q), we get 0(n _1 / 2+£ ) error 
term. For ease of exposition, we take e = 0.0001 and present the error in the 
order 0(n -0 ' 4999 ). 



Proposition 1. In a trie of n random keys following the unbiased 
Bernoulli model, the distance A n between two randomly selected keys has 
the mean value 

E[A n ] = 2 lg n + tflg n) - + O (^sss) ' 

where = —2rj2(-), with r^G) being the small oscillating function given 
in (6). 



3.4. The variance. One can continue pumping moments by taking deriva- 
tives of P*(t, s). The derivation of higher moments follows the same general 
principles that were used for the mean: fixed-population recurrence, pois- 
sonization, asymptotic solution by the Mellin transform and its inverse, then 
depoissonization. However, the task becomes too daunting, even for the sec- 
ond moment. This combinatorial explosion phenomenon is common folklore 
in random graphs. Let 

■fN(z) 



L(z) :=E 



V 2 



A 



N(z) 



2z A 



Hence, 



L*(s) 



d 2 P*(t,, 



dt 2 



t=o 



2 s+1 r(s + 2) 
(l_2*+i)(l-2 s + 2 ) 



s+4 



1 - 2 S + 2 



+ 



1-7-2 



=+2 



2 s+2 



fc=0 



(1 + 1/2^+1)^+2 



well defined in (—3,-2). 

The inverse Mellin transform is 

1 



Liz) = 

v ' 2ni 



-5/2+ioo 
5/2-ioo 



L*(s) Z - s ds, 
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which again we handle by shifting the line of integration to the right and 
account for the shift by the residue of the poles introduced. The transform 
L*(s) has a triple pole at s = — 2, which provides the asymptotically domi- 
nant term, double poles at s = — 2 + t^?t, for j = ±1, . . . , and simple poles 



In 2 

at s = — 1 + for k = 0, ±1, . . . and at s ■ 



-3,-4,.... 



We have the following residue contributions: 

= -2z 2 lg 2 z- 2z 2 Igz 



Res \L*(s)z- 



ln2 



In 2 2 



1 5 
-vr 2 + 27 2 - 27ln2 + -ln 2 2 
3 3 



Res [L*{s)z- 

s=— 1 



V Res [L*(a)z- 

rrLs=-2+2irji/hiT 



-21n2^1n(l + pTT 
— (--2? ) 



J2 Res. h \V(s)z- s ) = zt 2 (\gz), 



-l+27rfci/ ln2 



where £i(-) an ^(O are oscillating function given by 



1 



10-2 2 

. fc = 



> / s 1 r>^A 2nik\ 



+ 



■2niku 



(1 + 1/2*H-1)2™?/In2 

4^(27rij'/ln2) 



In 2 



4u , 



3 



2E 

i=o 

and ^(-) is the digamma function. 
Hence, for any arbitrary 9 > 0, 

2 7 



1 



(1 + i/2i+ 1 ) 1 + 2 ^ fc / ln2 



= 2 2 2 lg 2 z + 2zMgz 



In 2 
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In 2 2 



1 5 00 / 

-vr 2 +27 2 - 27ln2 + -ln 2 2-21n2y In 1 + 



k=0 



2 k+l 



ln2 v 2 h l 



2 k ~ 



+ z 2 U\gz)-zU^z)+0{z- 



Recall that L(z) is a shifted poissonized version of the problem, with a 
number of keys N(z) that follows a Poisson distribution with parameter z. 
Therefore, 



E 



N(z) 
2 



A 2 



2z 2 lg 2 z + 2z 2 lgz(^ 



1 



z 



In 2 2 



lvr 2 + 2 7 2 



11 



27 In 2 + — In 2 2 

' 3 



(7) 



21n2]Tln(l + 



2 k+i 



k=0 

— L/^-af-J— 1 

ln2^2 ^1 + 2*+^ 
+ z 2 U\gz)-zU^z) + 0{z~ e ). 

The depoissonization of (7) by the depoissonization lemma [7] finally 
gives us the second moment for the fixed population in the same form 
with n replacing z; the depoissonization introduces a small error of the 
order 0(n~ 4999 ); the genesis of this particular representation of error is 
discussed in the paragraphs preceding Proposition 1. The entire validity 
of depoissonization bears on arguments similar to those discussed in some 
more detail prior to Proposition 1. The variance follows by subtracting the 
square of the mean (cf. Proposition 1). Interestingly, the lg 2 n and lgn terms 
both disappear, leaving behind only a 0(1) function, consisting of a steady 
component and negligible oscillations. 



Theorem 1. In a trie ofn random keys following the unbiased Bernoulli 
model, the distance A n between two randomly selected keys has the variance 



Var[A n ] 



2vr 2 + 19 In 2 2 - 121n2]T;£io H 1 + V 2 



3 In 2 2 

+ £0g«) + 0(7^999), 

where £(•) is an oscillating function given by 

. 16 In 2 + 87 . . A 2 , . A , . . 
= -^(u) - 47 ? 2 (u) - 4£ 3 {u) 
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with r}2{-) given in (6) and given by 




(1 + l/2 fc + 1 ) 2 ^'/ii 



In 2 



2^(27rij/ln2) 
ln~2 



) 



The function £3 in Theorem 1 is absolutely convergent, by arguments 
that are essentially the same as those given for the function 7/1 and 772 that 
appear in the mean; see the remark following (6). In fact, £ offers only 
negligible oscillations; its magnitude is bounded by 0.5654042648 x 10 -4 . 
The arguments in [16] may suggest a way to simplify £. In view of the sharp 
O(l) variance concentration of A n , a concentration law is an immediate 
corollary (by Chebyshev's inequality). 

Corollary 1. As n^> 00, 

lgra 

3.5. The oscillatory distribution. The presence of the oscillating function 
K,(t, lgn) in the moment generating function of A n indicates that a limit 
distribution does not exist. In this section we make this argument more 
explicit. We have determined in Section 3.3 that the mean has 2 lgn leading 
asymptotic term. We have also seen in Section 3.4 that the variance is 0(1). 
One then looks into the behavior of the integer random variable of A* = 
A n — [21gnJ, if there is a chance for convergence. The main result of this 
paper answers this question in the negative. 

Theorem 2. Let A n be the distance between two randomly selected keys 
in a trie of n random keys following the unbiased Bernoulli model. Then 
A n — |_21gnJ has an oscillating moment generating function, and, conse- 
quently, does not converge in distribution to any random variable. 

Proof. One finds from (5) that A* has the moment generating function 



(8) E[e A " t ] = (G(t)+H n (t))e 



,i(21gn-L21gnJ) + q [^±^(2 lg n- [2 lgnj ) 



) 



where 



G{t) 



2r(-2i/ln2)e 3 '(l-e*) 
(l-2e 2 *)ln2 
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and 



_ ^ 2e- 2 ^' ^V-l) / -2t + 27rij \ 
nW "^ (l-2e 2 *)ki2 ^ l n 2 J 



e 2t -e l -2 



CO 

tk 



k=0 



L+ 2^) 



(2t-27rij')/ln2 



Let {#} denote the fractional part of x, that is, 

{x} = x — [x\. 
Using this notation in (8), we find 

E[e A «*] = (G(t) + H n (t))e^ n ^ + O (^e* 21 ^ , 

for the range \t\ < ^ ln 2. For the rest of the proof, we restrict t to the interval 
[—0.1, 0.1]. For any fixed t from this range, the function H n (t) provides small 
ignorable oscillations around G(t), and, of course, the CKn-^lgne^s™}*) 
term can be made arbitrarily small. It is well known that the function {2 lg n} 
is dense in the interval [0,1); see, for example, [10]. The term {21gn} is 
genuinely oscillating. Also, for t > 0, 

inf G{t)e^ n ^ = G{t) and supG(t)e^ lgn >* = e*G(t) 

™>0 n>0 

and, for t < 0, 

inf G(t)e^ 21gn >* = e'G(i) and supG(t)e {21gn}4 = G{t). 

n >0 n>0 

Then, for large n, the moment generating function E[e A ™ 4 ] itself comes in- 
finitely often very close to the two functions G(t) and G(t)e l . These two 
"envelopes" are distinct for all values of t with the exception of that of 
t = 0, where both values are equal to 67(0) = 1. Namely, 

\(G(t) + H n {t))e^ n ^\ > (G(t) - \H n (t)\)e^ n ^\ 

and for nonnegative t in the range of interest, we have 

211 — e*| 



\Hn(t)\ < 



1 -2e 2 *|ln2 
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2t + 2mj \ 



In 2 



< 



xE 

2|1 - e*| 
|1 -2e 2 *|ln2 



e 2 ' - e* - 2 



fc=0 



/A' 



1 + 



2 k+l 



*E 

2|1 - e*| 
1 -2e 2 *|ln2 

<3 x 10" 4 . 



-2t + 2vrij 



In 2 



< 



e 2i - e* - 2 



+ 



e 3t 1 1 — e* 
12 — e*| 



E 



/ -2t + 27TZj \ 

ln2 / 



The latter numerical bound follows from maximizing the part outside the 
sum by standard calculus arguments, and working out a numerical value for 
the sum as was done for the mean [see the remarks following (6)]. Similarly, 
for negative t in the range of interest, 



\H n (t)\ < 



211 - e* 



l-2e 2 *|ln2 



e 2t _ e t_ 2 



+ e 3y _!|£ e tfc 

k=0 



^ \2i/ln2- 

L+ 2W) 



< 



xE 

2 1 1 — e* | 
Il-2e 2 *|ln2 



-2t + 2mj\ 
In 2 ) 



3 2i - e* - 2 



+ e 3*| e t_!|^ e 

fc=0 



< 



*E 

2|1 - e*| 
II - 2e 2 *|ln2 



-2t + 2vrr/\ 
In 2 ) 



e 2t - e* - 2 



+ e 3t )E 



-2t + 2vrij\ 
In 2 J 



< 3 x 10" 



The numerical bound is found in a manner similar to that in the case of 
t > 0. 
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In either case, 3 x 10 4 is an upper bound. Hence, for t > to [where < 
t < 0.1 is the point where (G(t) - 0.0003)e* and G(t) + 0.0003 intersect], 

limsupE[e A «*] > limsup(G(t) - 0.0003)e {21gn} * 

n—*oo n— >oo 

= (G(t) - 0.0003)e*. 

Similarly, 

liminf E^™*] < G(t) + 0.0003. 

n — >oo 

The two bounds are different, as G(t) > 0.98, uniformly in t. For instance, 
letting t = jq gives 

liminf E[e (1/10)A «] < 1.148 < 1.267 < limsupE[e (1/10)A ™]. n 

n— >oo n— »oo 

In principle, one can invert a moment generating function. For integer r, 
we have 

p r (A; = r) = — e- iur E[e iuA "]du. 

27T Jo 

The integrals may not be easy to work through. For example, for r = 0, we 
have 

Pr(A; = 0) = — G(iu)e {2lsn}iu du + — [** H n {iu)e {21sn}iu du 

2"7T Jo 27T Jo 




The functions G(-) and H n {-) are complicated, but it is clear that Pr(A* = 
0) is oscillating. 

4. Discussion. Tries offer many applications in different areas, such as 
computer science, telecommunications and computational biology. Applica- 
tions include a variety of pattern matching algorithms on words, as well as a 
model for analyzing the periodicities and autocorrelation between the sub- 
strings of a string. This is an area of interest to the study of DNA sequences 
(digital data on an alphabet of four letters) in computational biology, when 
the sequences are almost random. 

By purely analytic methods, we investigated distances between pairs of 
keys in a random trie on n keys. The average value is 2\gn (modulated 
by some small oscillations), the variance is 0(1), and the centered distance 
does not have a limit distribution, but rather oscillates between two extremal 
values. The nonexistence of limits in the context of digital trees has been 
noted before in [3], who studied the height of random incomplete tries by 
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a mix of probabilistic and analytic methods. The probabilistic behavior of 
other related parameters can be obtained from our results. For example, it 
is a corollary to our results that the Wiener index 



the sum of all pairs of distances (Dy is the distance between the ith and 
jth keys), has an average value equal to 



To put this result in appropriate perspective, compare with [14]. 

A trie is said to be complete when it is fully grown at each level (except 
possibly the last one) . All the levels in a complete trie are filled with internal 
nodes, except the last two (or one) where the keys reside. Thus, in such a 
complete trie, the keys are at distance [lgnj or 1 + [lgnj. This balance is 
usually a desired feature for the fast retrieval of data on average. In the 
complete trie, if pairs of leaves are chosen at random, both will always be at 
depth lgn + 0(l). It can be shown by a recursive argument that the distance 
between a pair of keys has an asymptotic average 21gn and variance O(l). 
Thus, our results confirm that the trie is an excellent choice for random 
digital data; it tends to naturally balance itself, behaving rather closely as 
an ideally balanced complete trie. 

It appears to us that the results can be extended without introducing 
essential difficulty to the case of data on alphabets larger than binary, which 
can be useful for DNA studies. The problem that can possibly be of different 
structure, giving an interesting offshoot, is that where the Bernoulli model is 
biased. In the binary case, for example, one would consider an ergodic source 
emitting bits of l's with probability p i, and O's with probability q = 
1—p^ | . Several symmetries employed in the balanced Bernoulli case cease 
to exist. Under a biased Bernoulli model, the functional equations obtained, 
both under a fixed-population model and a poissonized data model, will be 
markedly different. We leave this as a topic for future research. 



A Mellin viewpoint of the random depth. There are several known re- 
sults for the random depth derived by probabilistic methods [see [1] for exact 
analysis]. See also the related studies of asymptotics in [6, 8, 17]; these re- 
sults are surveyed in [11]. However, we need the Mellin transform viewpoint 
of these results to plug in the functional equations arising for A n . 

We start from 



W n = Y,D. 
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with probability — , 

n 



with probability — . 

n 
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As in the formulation of the functional equation (2) for A n , we use the 
symmetry of the left and the right subtrees and the fact that L n is distributed 
like Bin(n, ^). After multiplying by z n and summing over all possible values 
of n, we get 

where ^(t,z) := ^ ~fr ' ■ Multiplying further by e~ z to poissonize, we 
get 

e-*V(t, z) = 2e t e- z/2 ^ (t, + ze~ z {\ - e*). 

We then introduce a shift of z (to ensure the existence of the Mellin trans- 
form) to get 



e~ z ^(t,z) - z = 2e t 
that is, 



2 2 



+ ze t - z + ze z (l -e*), 



(9) Q(t, z) = 2e l Q It, -J - z(l - e~*)(l - e*), 

where Q(t, z) := e~ 2: ^'(t, z) — z. 

One can recover the asymptotic result of Jacquet and Regnier [6] or Pittel 
[17] by completing a Mellin inversion and a depoissonization program. How- 
ever, our purpose is specific to the task of developing a functional equation 
for 

8(t,z) :=Q 2 (t,z), 
as we need its transform in P*(t,s). Hence, square (9) above to get 

Q 2 {t,z) = Ae 2t Q 2 (t^ 



4e'z(l - e~ z ){\ - e*)Q( t, £ ) + z 2 (l - e" 2 ) 2 (l - e') 2 , 

that is, 



Q(t,z) = Ae zt @(t,- 



»(t,|)+z 2 (l-e^) 2 (l-e*) 2 



Therefore, 



e*(t, s) = ie 2t 2 s e*(t, s) - Mellin|4e'z(l - e~ 2 )(l - e*)Q (t, ; s 
+ Mellin{z 2 (l - e~ z ) 2 (l - e') 2 ; s}, 
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which means that 



e*(i, s) = ^-Mellin|4e i z(l - e~ z ){l - e*)Q ft, ; s J 

+ Mellin{z 2 (l - e" z ) 2 (l - e*) 2 ; a}) /(l - e 2 *2 s+2 ) 



APPENDIX B 

Mellin transform of P(t, z). Taking the Mellin transform of (3), we get 
P*(t, s) = 2 s+1 P*{t, s) + Mellin jp^i, - e 2 *^; sj. 



That is, 
(10) 

However, 



P*(M) 



Mellin{F(t, z/2) - e 2 V/4; s} 
1 - 2 S + X 



Ft, 



„2t' 



„2t 



(11) 



e — = e e 
4 



e 2 *e- z 



2 h jl ^ 2 

,e 2 i' 



J2*' 



+ ze~ 2 e 2i 



3=2 



\2 



.24' 



-(1-e"*). 



In order to get the Mellin transform of the first part of the right-hand side 
of (11), we first let 



3=0 



J 



Hence, 



-z/2 



Q 2 (t,0+^(l-e-/ 2 )Q(t,0 



+ ^(l + e- z -2e-^ 2 ), 
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with Q(t, z) = f(t, z) — z. Therefore, for the first part on the right-hand side 
of (11), we have the Mellin transform 



Mellin <^ e 2t e~ z 



(12) 



■i=2 

2*^2 



: s 



MelW e zt Q z [t,- );s 



+ Mellin jze 2 * (1 - e" z/2 )Q (t, ; s 
+ Mellin j^e 2 *(l + e~ z - 2e~ z / 2 ); sj 



Moreover, the relation 



Q(M) = 2e*Q U,- - z(l - e~ z )(l - e*) 



gives, by recursion, 
(13) 
Hence, 



Q(*,i)=-£(2e 

fe=0 



2 fc+l 



( l_ e -/2 fe+1 )(1 _ e t). 



Mellin|4e*z(l - e" z )(l - e*)Q ^, ; sj 

= _ 4e *(l - e ') 2 / (1 - e~ z ) £(2e*)*-iUl - e"^ V dz 
Jo k=o z 

= -2e t (l-e t ) 2 J2 etk / (l-e" z )z s+1 dz 
fc=o i-Jo 

Jo Jo 



e- z e- z ' 2k+1 z s+l dz 



-2e*(l- e*) 2 r(s + 2)£e 

fc=0 



/A' 



-l-(2 



fc+l\s+2 



+ 



(1 + 1/2^ 



1V+2 



In addition, 



Mellin{^(l - e-T(l - eT;s} 

= (l-e') 2 / ^(l-e-^V -1 ^ 



(1-e*) 2 
(1-e 



2(1 - e" z )z s+1 dz- I (1 - e"^)z s+i dz 



-2z\ s+1 



^ 2 (i-2*+ 3 )r( s + 2) 



2 s+2 
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Hence, we have 

'2e*(l-e*) 2 r(s + 2) 



e*(M) 



1 _ e 2t 2 s+2 



x E e 

fc=0 



tk 



.1 _ (2 fe+1 ) s + 2 + 



(l + l/2 fc + 1 ) s + 2 _ 



+ (l-e 



n2 ( l-2*+ 3 )r> + 2) * 

2 s+2 



which means that the Mellin transform of the first part on the right-hand 
side of (12) is 

Mellin \e 2t Q 2 (t,-);s 



e 2t 2 s e*(t,s) 



e 2t 2 s 



2e*(l-e t ) 2 r(s + 2) 



x e tk 

k=0 



■l-(2 



+ 



fc+l\s+2 



+ 



(l + l/2 fc + 1 ) s + 2 _ 

(l-e f ) 2 (l-2 s+3 )r( S + 2) 

2 s+2 



Now, using (13), we note that by similar steps we have 
Mellin jze 2t e- z/2 Q (t, ; s J 

OO -I 

= gi e (fc+2) i(1 _ e<) 



fc=0 



x r - z 2 e- z l 2 {l-e- z ' 2k+1 )z s - l dz 



1 



1 



(l + l/2*)'+2 



= -2 s+1 e 2 *(l - e')r(s + 2) £ e* fc 

fc=o 

Hence, the second Mellin transform on the right-hand side of (12) can be 
calculated as 



Mellin|ze 2 *(l-e-^ 2 )g^,0;sj 
= Mellin jze 2 *Q^,0;sj 
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= 2 s + 1 e 2t (l - e*)r(s + 2) 

1 
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x E e 

fc=0 



2 fc( S +2) , x 



(l + l/2*+i)«+s 

Finally, the third Mellin transform on the right-hand side of (12) is 
Mellinj ^-e 2t (1 + e~ z - 2e"^ 2 ); s J 



.<! ^-e 2t 



[(l- e -*/ 2 )- e -*/2(l_ e -*/2)]; S | 



_2 S+1 )e 2 'r(s + 2). 



Therefore, putting the three pieces together, we obtain the first part of the 
right-hand side of (11) as 

( V fYl ■ I / i \ , v A ~l 2 

lin< ( 



Mellinj e 2 V 2 



e 2t 2 s 



2e t (l-e*) 2 r(s + 2) 



1 _ e 2<2s+2 



(14) 



k=0 



tk 



-l-(2 



fc+l\s+2 



+ 



+ (l-e 



(1 + 1/2^+1)^+2 

.(i-2^+ 3 )r( s + 2)' 



2 S + 2 



+ 2 s+1 e 2 *(l-e*)r(s + 2) 



x E e 

fc=0 



/A- 



2 fc ( s +2) + 1 



(l + l/ 2 fc+l)«+2 



+ Q-2 s+1 ^)e 2 *r(s + 2). 
For the second part of the right-hand side of (11), we start from 



ze 2t e~ z 



ze 2t e-*/ 2 f[t, 



z\ z 



2t-z 



-e e 



ze 2t e- z/2 Q[t,~-) + ~-e zt e 



z\ z 



J2t-z/2 _ *2t 



e e 
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Hence, 

MellinJ ze~ z e 2t 



^ j\ V2 

3=1 J V 



'; S ^Mellin{^ e -^(,|); S 
+ Mellin|ye 2 *e- 2 / 2 ; S | 



-Mellin|ye 2i e-*;sj. 

Dealing with each piece separately and utilizing results from before, we 
obtain the following results for the first two pieces: 



MeUi„{^ ( 4 ,£) ;s } 



-2 s+1 e 2f (l-e*)r(s + 2)^e 

fc=0 



tk 



and 

Mellin 

The last piece gives 



-e 2 *e-^ 2 ; S 



Mellin<! — e 2t e~ z :s 



(l + l/2*)*+2 
j = 2 s + 1 e 2t r(s + 2). 



,2f 



2 6 



e~ z z s+1 dz 



= -e 2t T{s + 2). 

Therefore, the second part on the right-hand side of (11) gives 



MellinJ ze~ z e 2t f] (l)' ;S ) = -^^i 1 " e *) r ( s + 2 ) 



i=2 



fc=0 



tk 



(l + l/2*)«+2 



(15) 



+ 2 s+1 e 2t T(s + 2) - ^e 2t T(s + 2). 



For the third and last part of the right-hand side of (11), we get that 
Mellin je 2 '^(l - e - z );sJ = ^e 2t (I - e~ z )z s+1 dz 



(16) 



1 



e 2 T(s + 2). 



After putting (14), (15) and (16) together and doing some algebra, equa- 
tion (10) results in Lemma 1. 
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